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Abstract — In this paper, we present an iterative soft- 
decision decoding algorithm for Reed-Solomon codes of- 
fering both complexity and performance advantages over 
previously known decoding algorithms. Our algorithm is 
a list decoding algorithm which combines two powerful 
soft decision decoding techniques which were previously 
regarded in the literature as competitive, namely, the 
Koetter-Vardy algebraic soft-decision decoding algorithm 
and belief -propagation based on adaptive parity check ma- 
trices, recently proposed by Jiang and Narayanan. Building 
on the Jiang-Narayanan algorithm, we present a belief- 
propagation based algorithm with a significant reduction 
in computational complexity. We introduce the concept of 
using a belief-propagation based decoder to enhance the 
soft-input information prior to decoding with an algebraic 
soft-decision decoder. Our algorithm can also be viewed 
as an interpolation multiplicity assignment scheme for 
algebraic soft-decision decoding of Reed-Solomon codes. 



I. Introduction 

Reed-Solomon (RS) codes [1] are among the most 
celebrated forward error correcting codes. The RS codes 
are currently used in a wide variety of applications, 
ranging from satellite communications to data storage 
systems. Reed-Solomon codes have been adopted as 
outer codes in the 3G wireless standard, CDMA2000 
high-rate broadcast packet data air interface [2], and 
are expected to be used as outer codes in concatenated 
coding schemes for future 4G wireless systems. 

Maximum likelihood (ML) decoding of linear codes, 
in general, and RS codes, in particular, is NP-hard [3], 
[4]. It remains an open problem to find polynomial- 
time decoding algorithms with near ML performance. 
A soft-decision ML decoding algorithm was proposed 
by Vardy and Be'ery [5]. Further modifications of this 
algorithm were also studied [6]. Guruswami and Sudan 
(GS) [7] [8] invented a polynomial-time list decoding 
algorithm for RS codes capable of correcting beyond half 
the minimum distance of the code. Koetter and Vardy 
(KV) [9] developed an algebraic soft-decision decoding 
(ASD) algorithm for RS codes based on a multiplicity 
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assignment scheme for the GS algorithm. Alternative 
ASD algorithms, such as the Gaussian approximation 
algorithm by Parvaresh and Vardy [10] and the algorithm 
by El-Khamy and McEliece based on the Chernoff 
bound[ll], [12], have better performance. 

Jiang and Narayanan (JN) developed an iterative al- 
gorithm based on belief propagation for soft decoding 
of RS codes [13], [14]. This algorithm compares fa- 
vorably with other soft decision decoding algorithms 
for RS codes and is a major step towards message 
passing decoding algorithms for RS codes. In the JN 
algorithm, belief propagation is run on an adapted parity 
check matrix where the columns in the parity-check 
matrix corresponding to the least reliable independent 
bits are reduced to an identity submatrix [13], [14]. The 
order statistics decoding algorithm by Fossorier and Lin 
[15] also sorts the received bits with respect to their 
reliabilities and reduces the columns in the generator 
matrix corresponding to the most reliable bits to an 
identity submatrix. This matrix is then used to generate 
(permuted) codewords using the most reliable bits. Other 
soft-decoding algorithms for RS codes include the gen- 
eralized minimum distance (GMD) decoding algorithm 
introduced by Forney [16], the Chase II algorithm [17], 
the combined Chase II-GMD algorithm [18] and succes- 
sive erasure-error decoding [19]. 

In this paper, we develop an algebraic soft-decision 
list decoding algorithm based on the idea that belief 
propagation-based algorithms could be deployed to im- 
prove the reliability of the symbols that is then utilized 
by an interpolation multiplicity assignment algorithm. 
Our algorithm combines the KV and the JN algorithms. 
An outline of the paper is as follows. Some prelimi- 
naries are given in section ITfl In section [HI] we briefly 
review algebraic soft-decoding algorithms, in general, 
and the KV algorithm, in particular. The JN algorithm 
is explained in the context of this paper in section IIVI 
Some modifications to the JN algorithm are introduced 
in section [V] One of the main contributions in this 
paper, the iterative algebraic soft-decision list decoding 
algorithm, is presented in section IVII Another main 
contribution, a low complexity algorithm based on the JN 
algorithm, is presented in section IVTll Some discussions 
as well as some numerical results are presented in section 
IVIIII Finally, we conclude the paper in section IIXI and 
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suggest future research directions. 

II. Preliminaries 

Throughout this paper, d = [do, d\, dk-i] will 
denote a k dimensional vector over F q where F q is 
the finite field of q elements. C will denote an (n, k) 
RS code. An (n, k) RS codeword u = [u ,ui, ..,u n -\] 
could be generated by evaluating the data polynomial 
D(x) = J2i=o ^ iX% at n elements of the field composing 
a set, called the support set of the code. This set is vital 
for the operation of the Guruswami-Sudan algorithm. Let 
a be a primitive element in F q . Since the polynomial 
U(x) = Y^i=o UiX% associated with the codeword u E C 
generated by polynomial evaluation has a, a 2 , .., a n ~ k 
as zeros [20], a valid parity check matrix for C is [21] 
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The redundancy of the code's binary image will be 
denoted by f where f = h — k and n = vnn and k = 
mk. The results in this paper assume that the binary 
image and the corresponding binary parity check matrix 
are of the form described here. Let p(x) be a primitive 
polynomial in i<2[x] and C be its companion matrix [22]. 
The companion matrix is an m x m binary matrix. Since 
the mapping a 1 <-> C l , {i = 0,1,2,..} induces a field 
isomorphism, an f x h binary parity check matrix H 
is obtained by replacing every element a 1 in the parity 
check matrix H by its corresponding mx m matrix C\ 
The binary image b, such that Hb T = 0, is obtained by 
representing each element Uj e i 7 ^™ with Uj = Uj t o + 
Uj^a + ... + Uj^n^ia" 1 ^ 1 where Uj t i € F 2 . 

An q x n array of real numbers will be denoted by 
W = [Wi(j3)], where i = 0, 1, ...,n - 1 and (3 e F q . If 
u is transmitted and the corresponding channel output is 
y, then we denote the a-posteriori probabilities Pr{u{ = 
/%} by IL(/3). 

III. Algebraic Soft Decoding 

An algebraic soft decoder makes use of the soft infor- 
mation available from the channel. Given the a-posteriori 
probability matrix LT, a multiplicity assignment algorithm 
generates angxn multiplicity matrix, M — [Mj (/?)], of 
non-negative integers. The interpolation cost of M is de- 
fined to be 1 \M\ 4 \ J^~o E^f, W) ^{(3) + I) 
and the score of u with respect to M is (it, M) = 
E"=o 1 Mi(ui). This multiplicity matrix is then passed 
to a (modified) GS algorithm consisting of two main 
steps [7], [23] 

1) Interpolation: Construct a bivariate polynomial, 
Q(x,y), of minimum (1, k — 1) weighted degree 
that passes through each of the points {T t , (3) 

'To prevent notational ambiguity, ||x||i will denote the magnitude 
of x. 



with multiplicity Mj(/3), where (3 <G F q and i = 
0,l,..,n- 1. 

2) Factorization: Find all linear factors (y — 
G(x))\Q(x,y) where G(x) is a polynomial of 
degree less than k. Each such polynomial G(x) 
is placed on the list. 

A solution to the interpolation problem exists if \M\ is 
strictly less than the number of monomials in Q such that 
Q is of minimal (1, k — 1) weighted degree Afc_i (|M|) 
[24]. A sufficient condition for a codeword u to be on 
the GS generated list is [7], [9], 



(u,M)>A k _ 1 (\M\), 



(2) 



where A v (7) 
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[11]. In case the cost tends to 
infinity, the sufficient condition is [9], [11] 
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In this paragraph, we briefly review well-known ASD 
algorithms. For more details, we refer the readers to 
the given references. The KV algorithm maximizes the 
mean of the score. A reduced complexity KV algorithm 
constructs the multiplicity matrix M as follows [9], [25] 



Mi(0)= l\Il t ((3)\, 



(4) 



where A > is a complexity parameter determined 
by \M\. For \M\ = 7, it can be shown that A = 
(-1 + ^l + 8j/n)/2. Other algorithms [10] and [11] 
minimize the error probability directly. The algorithm 
of [10] (Gauss) assumes a Gaussian distribution of the 
score, while that of [1 1] (Chernoff) minimizes a Chernoff 
bound on the error probability. The later appears to have 
the best performance. 

IV. Adaptive Belief Propagation 

Gallager devised an iterative algorithm for decoding 
his low-density parity check (LDPC) codes [26]. This 
algorithm was the first appearance in the literature of 
what we now call belief propagation (BP). Recall that 
H is the parity check matrix associated with the binary 
image of the RS code. It has r rows corresponding to 
the check nodes and n columns corresponding to the 
variable nodes (transmitted bits). Hij will denote the 
element in the i th row and j th column of H. Define the 
sets, J(t) 4 {j I H itj = 1} and 4 {1 | H id = 1}. 
Define Qij to be the log-likelihood ratio (LLR) of the 
jth symbol, Uj, given the information about all parity 
check nodes except node i and Rij to be the LLR that 
check node i is satisfied when Uj is fixed to and 1 
respectively. Given the vector A m of initial LLRs, the 
BP algorithm outputs the extrinsic LLRs A x as described 
below [27] [28]. 

Algorithm 1: Damped Log Belief Propagation (LBP) 
For all (i, j) such that H it j = 1: 
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Initialization: Qi t j = A™ 
DO 

Horizontal Step: 

/ l + n fce .7 W \ 3 tanh(g a ./2) \ 
°H 1 -n fce ,7 W \,tanh(g i , fc /2) y l 

= 2tanir 1 Yl tanh(g j!fc /2) (5) 
\ke,J(t)\j J 

Vertical Step: 

Qi,j = A™ + 9 22 
kei(j)\i 

While stopping criterion is not met. 

Extrinsic Information: AJ = Y^keitj) ^k,j- 
The factor 9 is termed the vertical step damping factor 
and < < 1. The magnitude of 9 is determined by 
our level of confidence about the extrinsic information. 
In our implementations, 9 is 0.5. Eq. [5] is specifically 
useful for fast hardware implementations where the tanh 
function will be quantized to a reasonable accuracy and 
implemented as a lookup table. In our implementation, 
damped LBP is run for a small number of iterations on 
a fixed parity check matrix, so the stopping criterion 
is the number of iterations. In case that only one LBP 
iteration is run on the parity check matrix, the vertical 
step is eliminated. 

Following we describe the JN algorithm [13], [14], 
which builds on the BP algorithm. In the JN algorithm, 
BP is run on the parity check matrix after reducing its 
independent columns corresponding to the least reliable 
bits to an identity submatrix. We will refer to such a class 
of algorithms, that adapt the parity check matrix before 
running BP, by adaptive belief propagation (ABP). 

Algorithm 2: The JN Algorithm 
Initialization: A p := A ch 
DO 

1) Sort A p in ascending order of magnitude and store 
the sorting index. The resulting vector of sorted 
LLRs is 

A in r a in \ in a in ] 

J\ — , iV 2 , ...,i\ nm \, 

IIArlli < \\K l +1 h for k = 1,2, ...,nm- 1 
and A ln = PA P where P defines a permutation 
matrix. 

2) Rearrange the columns of the binary parity check 
matrix H to form a new matrix Hp where the 
rearrangement is defined by the permutation P. 

3) Perform Gaussian elimination (GE) on the matrix 
Hp from left to right. GE will reduce the first in- 
dependent (n — k)m columns in Hp to an identity 
sub-matrix. The columns which are dependent on 
previously reduced columns will remain intact. Let 
this new matrix be Hp. 

4) Run log BP on the parity check matrix Hp with 
initial LLRs A 1 ™ for a maximum number of iter- 



ations Itp, and a vertical step damping factor 9. 
The log BP algorithm outputs extrinsic LLRs A x . 

5) Update the LLRs, A q = A in + a x A x and A p := 
p- 1 A q where < a x < 1 is called the ABP 
damping factor and P^ 1 is the inverse of P. 

6) Decode using A p as an input to the decoding 
algorithm D. 

While Stopping criterion not satisfied. 

The JN algorithm assumed that the decoder D is one 
of the following hard-decision decoders: 

• HD: Perform hard-decisions on the updated LLRs, 
u = (1 - sign(A p ))/2. If Hu T = 0, then a 
decoding success is signaled. 

• BM: Run a bounded minimum distance decoder 
such as the Berlekamp-Massey (BM) algorithm on 
the LLRs after hard-decisions. If the BM algorithm 
finds a codeword, a decoding success is signaled. 

The performance largely depends on the decoder D and 
the stopping criterion used. This is discussed in the 
following section. 

V. Modifications to the JN Algorithm 

The stopping criterion deployed in the JN algorithm 
is as follows [14]: 

• Stop if a decoding success is signaled by the 
decoder D or if the number of iterations is equal to 
the maximum number of iterations, N\. 

We propose a list-decoding stopping criterion in which 
a list of codewords is iteratively generated. The list- 
decoding stopping criterion is as follows 

• If a decoding success is signaled by the decoder 
D, add the decoded codeword to a global list of 
codewords. Stop if the number of iterations is equal 
to the maximum number of iterations, Ni. 

If more than one codeword is on the global list of 
codewords, then the list-decoder's output is the codeword 
which is at the minimum Euclidean distance from the 
received vector. Alternatively, one could only save the 
codeword with the largest conditional probability, given 
the received vector. This codeword would be the candi- 
date for the list decoder's output when the iteration loop 
terminates. 

The advantage of our proposed list-decoding stopping 
criterion over the stopping criterion in the JN algorithm 
is emphasized in the case of higher rate codes, where 
the decoder error probability is relatively high. Given a 
decoding algorithm D, the JN ABP algorithm may result 
in updating the received vector to lie in the decoding 
region of an erroneous codeword. However, running 
more iterations of the JN ABP algorithm may move 
the updated received vector into the decoding sphere 
of the transmitted codeword. The decoding algorithm D 
should also be run on the channel LLRs before any ABP 
iteration is carried out. If the decoder succeeds to find a 
codeword, it is added to the list. 

Jiang and Narayanan [13] proposed running N2 paral- 
lel decoders (outer iterations), each with the JN stopping 
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criterion and a maximum of N± inner iterations. Each 
one of these iVjj iterations (decoders) starts with a 
different random permutation of the sorted channel LLRs 
in the first inner iteration. The outputs of these N2 
decoders form a list of at most N2 codewords. If each 
of these N2 decoders succeeds to find a codeword, the 
closest codeword to the received vector is chosen. We 
also run N2 parallel decoders (outer iterations), each 
with the list-decoding stopping criterion, to form a global 
list of at most N1N2 codewords. We propose doing the 
initial sorting of the channel LLRs in a systematic way 
to ensure that most bits will have a chance of being 
in the identity sub-matrix of the adapted parity check 
matrix. The improved performance achieved by these 
restarts could be explained by reasoning that if a higher 
reliability bit is in error, then it has a higher chance of 
being corrected if its corresponding column in the parity 
check matrix is in the sparse identity submatrix. 

Let z = [n/N 2 \, then at the (j + l) th outer iteration, 
j > 0, the initial LLR vector at the first inner iteration 
is 

[Am \in \in \in \in \i n l 

l/^z+i) ••) Iy (j+i) z > J n i •••>-' v iz' 0'+i>+i' « J' 

(6) 

where A m is the vector of sorted channel LLRs. The 
columns of Hp will also be rearranged according to the 
same permuatation. If (j + l)z < r, then it is less likely 
that this initial permutation will introduce new columns 
into the identity submatrix other than those which existed 
in the first outer iteration. After the first outer iteration, 
it is thus recommended to continue with the (j + l)th 
outer iteration such that (j + 1) > r/z. 

Another modification that could improve the perfor- 
mance of the JN algorithm is to run a small number 
of iterations of damped log belief propagation on the 
same parity check matrix. Although belief propagation 
is not exact due to the cycles in the associated Tanner 
graph, running a very small number of iterations of belief 
propagation is very effective [29]. Observing that the 
complexity of belief propagation is much lower than 
that of Gaussian elimination, one gets a performance 
enhancement at a slightly increased complexity. 

Throughout the remaining of this paper, we will 
refer to the modified JN algorithm with a list decoding 
stopping criterion, as well as with the other modifications 
introduced in this section, by ABP-BM if the decoding 
algorithm D is BM. Similarly, if the decoding algorithm 
was HD, the algorithm is referred to by ABP-HD. One 
of the main contribution in this paper, the utilization 
of the a-posteriori probabilities at the output of the 
ABP algorithm as the soft information input to an ASD 
algorithm, is presented in the following section. 

VI. The Hybrid ABP-ASD List Decoding 
Algorithm 

Koetter and Vardy [9] point out that it is hard to 
maximize the mean of the score with respect to the 
to the true channel a-posteriori probabilities. Previous 



multiplicity assignment algorithms [9]— [ 11] assumed 
approximate a-posteriori probabilities. The problem is 
simplified by assuming that the transmitted codeword is 
drawn uniformly from F™. Also, the n received symbols 
are assumed to be independent and thus be assumed to 
be uniformly distributed. In such a case, the a-posteriori 
probabilities are approximated to be a scaling of the 
channel transition probabilities, 

Pr{vi\ui = 0} 



nf(/3) 



J2ueF a Pr{Vi\ui =u} ' 



(7) 



However, from the maximum distance separable (MDS) 
property of RS codes any k symbols (only) are /c-wise 
independent and could be treated as information symbols 
and thus uniformly distributed. Thus these assumptions 
are more valid for higher rate codes and for memoryless 
channels. It is well known that belief propagation algo- 
rithms improve the reliability of the symbols by taking 
into account the geometry of the code and the correlation 
between symbols (see for example [27].) Due to the 
dense nature of the parity check matrix of the binary 
image of RS codes, running belief propagation directly 
will not result in a good performance. Because the 
Tanner graph associated with the parity check matrix of 
the binary image of RS codes has cycles, the marginals 
passed by the (log) belief propagation algorithm are 
no longer independent and the information starts to 
propagate in the loops. 

Jiang and Narayanan [14] proposed a solution to this 
problem by adapting the parity check matrix after each 
iteration. When updating the check node reliabilities 
Ri.j (see (0) corresponding to a pivot in a single 
weight column, the information Qi j from any of the 
least reliable independent bits does not enter into the 
summation. One reason for the success of ABP is that 
the reliabilities of the least reliable bits are updated by 
only passing the information from the more reliable bits 
to them. An analytical model for belief propagation on 
adaptive parity check matrices was recently proposed 
[30]. 

Our ABP-ASD algorithm is summarized by the fol- 
lowing chain, 



u 



(8) 



ASD 



where u is the transmitted codeword, A is a multiplicity 
assignment algorithm, M is the multiplicity matrix and 
u is the decoder output. In particular, the ABP-ASD list 
decoder is implemented by deploying the list decoder 
stopping criterion, proposed in the previous section, with 
an ASD decoding algorithm D (see Alg. |2j: 

• ASD: Using A p generate anijxn reliability matrix 
II which is then used as an input to an multiplicity 
assignment algorithm to generate multiplicities ac- 
cording to the required interpolation cost. This mul- 
tiplicity matrix is passed to the (modified) GS list 
decoding algorithm. If the generated codeword list 
is not empty, the list of codewords is augmented to 
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Iterative ASD of (15,11) RS code BPSK modulated over an AWGN channel 
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Fig. 1. The performance of iterative ASD of (15,1 1) RS code, which is 
BPSK modulated and transmitted over an AWGN channel, is compared 
to that of other ASD algorithms and ABP-BM list decoding 

the global list of codewords. If only one codeword 
is required, the codeword with the highest reliability 
with respect to the channel LLR's A ch is added to 
the global list. 
In this paper, the KV algorithm is used as the mul- 
tiplicity assignment scheme. More efficient but more 
complex MA schemes could also be used [11]. The joint 
ABP-ASD algorithm corrects decoder failures (the re- 
ceived word does not lie in the decoding region centered 
around any codeword) of the ASD decoder D, by iter- 
atively enhancing the reliabilities of the received word, 
and thus moving the received word into the decoding 
region around a certain codeword. The decoding region 
in turn depends on the algorithm D and the designed 
interpolation cost. Furthermore, it attempts to eliminate 
decoder errors (the decoded codeword is not the trans- 
mitted codeword) by iteratively adding codewords to the 
global list of codewords and choosing the most probable 
one. 

Since ASD is inherently a list decoding algorithm 
with a larger decoding region, it is expected that ABP- 
ASD outperforms ABP-HD and ABP-BM. Since our 
algorithm transforms the channel LLRs into interpolation 
multiplicities for the GS algorithm, then, by definition, 
it is an interpolation multiplicity assignment algorithm 
for ASD. 

The ABP-ASD algorithm has a polynomial-time com- 
plexity. The ABP step involves o(n 2 ) floating point 
operations, for sorting and BP, and o(min(k 2 ,f 2 ) n) 
binary operations for GE [13]. As for ASD, the KV MA 
algorithm (see @) has a time complexity of 0(n 2 ). An 
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Fig. 2. The performance of iterative ASD of (15,11) RS code, which 
is BPSK modulated and transmitted over an AWGN channel, is shown 
for a finite interpolation cost of 10 3 and different iteration numbers. 

efficient algorithm for solving the interpolation problem 
is Koetter's algorithm [23] with a time complexity of 
0(n 2 A 4 ). A reduced complexity interpolation algorithm 
is given in [24]. Roth and Ruckenstein [31] proposed an 
efficient factorization algorithm with a time complexity 
0((l log 2 l)k(n + llogq)), where I is an upper bound on 
the ASD's list size and is determined by A. 

VII. A Low Complexity ABP Algorithm 

Most of the complexity of adaptive belief propagation 
lies in row reducing the binary parity check matrix (after 
rearranging the columns according to the permutation 
P). To reduce the complexity one could make use of the 
columns already reduced in the previous iteration. 

We will use the same notation as in Alg. 2 with 
a subscript j to denote the values at iteration j. For 
example, the vector of sorted LLRs at the jth iteration 
is A*™. Define Pj(H) to be the matrix obtained when 
the columns of the parity check matrix H are permuted 
according to the permutation Pj at the jth iteration. 
GE(H) will be the reduced matrix (with an identity 
submatrix) after Gaussian elimination is carried out on 
the matrix H. 

Let Rj = {t : tth column of H was reduced to 
a column of unit weight in GE(Pj(H))}. It is clear 
that the cardinality of Rj is f. Now assume that log 
BP is run and that the LLRs are updated and inverse 
permuted to get A^ (step 5 in Alg. 2). The set of indices 
of the r (independent) LLRs in A^ with the smallest 
magnitude will be denoted by Sj+i. By definition, Pj+i 
is the permutation that sorts the LLRs in A?? in ascending 
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Iterative ASD of (31 ,25) RS code BPSK modulated over AWGN Channel 




SNR (dB) 

Fig. 3. ABP-ASD list decoding of the (31,25) RS code transmitted 
over an AWGN with BPSK modulation. 



order according to their magnitude to get AV^. The 
set C/j+i = Rj H Sj+i is thus the set of indices of bits 
which are among the least reliable independent bits at the 
(j + l)th iteration and whose corresponding columns in 
the reduced parity check matrix at the previous iteration 
were in the identity submatrix. 

The algorithm is modified such that GE will be run 
on the matrix whose left most columns are those corre- 
sponding to Uj+i. To construct the identity submatrix, 
these columns may only require row permutations for 
arranging the pivots (ones) on the diagonal. Note that 
these permutations may have also been required when 
running GE on Pj + i(H). Only a small fraction of 
the columns will need to be reduced to unit weight 
leading to a large reduction in the GE computational 
complexity. Also note that what matters is that a column 
corresponding to a bit with low reliability lies in the 
identity (sparse) submatrix and not its position within 
the submatrix. This is justified by the fact that the 
update rules for all the LLRs corresponding to columns 
in the identity submatrix are the same. Thus provided 
that the first r columns in Pj + i(H) are independent, 
changing their order does not alter the performance of 
the ABP algorithm. To summarize the proposed reduced 
complexity ABP algorithm can be stated as follows: 

Algorithm 3: Low Complexity Adaptive Belief Prop- 
agation 

Initialization: A p := A ch , j = 1 
DO 

Uj = l 

Proceed as in the first iteration of Alg. 2; A™ = 

A in \ Alg . 2 , Pi = P\m b .2, fli = H p \ai b .2 and A? = 



If j>l 

1) Sort the updated LLR vector A| —1 in ascending 
order of the magnitude of its elements. Let Wj be 
the associated sorting permutation matrix. 

2) Rearrange the columns of the binary parity check 
matrix Hj-i to form a new matrix 

Q'j = Wjfa-!). 

3) Rearrange the most left f columns of the binary 
parity check matrix Q'j such that the columns of 
unit weight are the most left columns. Let W" be 
the corresponding permutation matrix. (This could 
be done by sorting the first f columns of Q'j in 
ascending order according to their weight.) Let the 
resulting matrix be 

g? = wf(05). 

4) Permute the LLR vector; 



where Pj = W<W<'. 

5) Update the (global) permutation matrix; 

Pj = PjPj-i- 

6) Run Gaussian elimination on the matrix Q'! from 
left to right; 

Aj = GE(Q'!). 

7) Run damped LBP on Hj with initial LLRs A*. n 
for Hr iterations. The output vector of extrinsic 
LLRs is AJ. 

8) Update the LLRs; 

A] = Af + ai A? and Aj = P^A]. 

9) Decode using Aj as an the input to the decoding 
algorithm D, 

10) Increment j. 

While Stopping criterion not satisfied. 

The algorithm as described above iteratively updates a 
global permutation matrix and avoids inverse permuting 
the row-reduced parity check matrix in each iteration. 
The implementation of the algorithm also assumes for 
simplicity that the columns in the parity check matrix 
corresponding to the f least reliable bits are independent 
and could therefore be reduced to unit weight columns. 
It is also noticed that in practice the cardinality of 
Uj+i is close to f which means that the GE elimination 
complexity will be significant only in the first iteration. 

We will assume the favorable condition in which 
the most left r columns of an parity check matrix are 
independent. Taking into account that the parity check 
matrix is a binary matrix, the maximum number of 
binary operations required to reduce the first f columns 
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Required Iterations for Successful Decoding of (31,25) RS Code. SNR=3.5 dB 
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Fig. 4. This histogram shows the percentage of transmitted code- 
words successfully decoded versus the iteration number at which the 
transmitted codeword was first successfully added to the ABP-ASD list 
with ATI = 20 and TV 2 = 10. The (31, 25) RS code is transmitted 
over an AWGN channel at an SNR of 3.5 dB. 



to an identity submatrix in the JN algorithm (Alg. 2) can 
be shown to be 



e 



GE 



2V(f-a)(n-a + l) < f 2 n-fk. (9) 



(It is assumed that the two GE steps, elimination and 
back substitution, are symmetric). Row permutation op- 
erations were neglected. Now assume that the cardinality 
of Uj+i is Sr, where 5 < 1. For the modified algorithm, 
only row permutations may be required for the first 
Sf columns to arrange the pivots on the diagonal of 
the identity submatrix. These permutations may also be 
required for the JN algorithm. Then the relative reduction 
in complexity is 

<d GE in Alg. 2 - <d GE in Alg. 3 
Qge in Alg. 2 



T,J=i(r - a)(n - a + I) 

Ea=i(r-a)(ra-a+ 1 ) 
(r 2 h)(2S - S 2 ) -Sfk 



rk 



2S-S Z 



(10) 



For example, if we assume that on average 5 = 0.5, 
a simple calculation for the (255, 239) code over i<256 
shows that the relative reduction in the complexity of the 
GE step is about 75%. In practice S is close to one. Note 
that Alg. 3 does require sorting f columns of Q'j (see 
step (|3}) according to their weight but the complexity is 
relatively small. 



VIII. Numerical Results and Discussions 

In the next subsection, a fast simulation setup is 
described for ABP list decoding. Bounds on the error 
probability of the ML decoder are then discussed. We 
then show simulation results for our algorithm. 

A. Fast Simulation Setup 

We describe a fast simulation setup for ABP with a 
list decoding stopping criterion. One could avoid running 
the actual decoder D at each iteration and instead check 
whether the transmitted codeword is on the list generated 
by the decoder D. The stopping criterion would be 
modified such that the iterative decoding stops if the 
transmitted codeword is on the list or if the maximum 
number of iterations is reached. A decoding success is 
signaled if the transmitted codeword is on the list. 

It is easy to see that this simulation setup is equivalent 
to running the actual ABP list decoder for the maximum 
number of iterations. Suppose that the received sequence 
results in an maximum likelihood (ML) error, then it is 
very unlikely that the decoder D will correctly decode 
the received word at any iteration. In case of an ML 
decoder success and the transmitted codeword is added 
to the global list at a certain iteration, which presumably 
could be checked, then it would be the closest codeword 
to the received word and thus the list decoder's choice. 
Thus for a fast implementation, a decoding success is 
signaled and iteration stops once the transmitted code- 
word appears on the global list. 

In case that D is a bounded minimum distance decoder 
such as the Berlekamp-Massey (BM) algorithm, the 
transmitted codeword would be on the global list if it 
is at a Hamming distance of < L^r^J from the hard- 
decisioned (modified) LLRs. If D is an ASD algorithm 
that assigns the multiplicity matrix M, the transmitted 
codeword is on the ASD's list (and thus the global 
list) if it satisfies the sufficient conditions of (0 and 
for finite and infinite interpolation costs respectively. 
It was shown in [9], that simulating the KV algorithm 
by checking the sufficient condition of (|2ji results in 
accurate results. This is partially justified by the fact 
that on average, the ASD's list size is one [32]. This 
is also justified by observing that if the ASD's list is 
empty (a decoding failure), the condition will not 
be satisfied. However, if the list is nonempty but the 
transmitted codeword is not on the list (a decoding 
error), the condition will still not be satisfied for the 
transmitted codeword and a decoding error/failure is 
signaled. However if the condition is satisfied, then this 
implies that the transmitted codeword is on the ASD's 
list and thus a decoding success. 

B. Bounds on the ML error probability 

As important as it is to compare our algorithms with 
other algorithms, it is even more important to compare it 
with the ultimate performance limits, which is that of the 



x 



soft decision ML decoder. When transmitting the binary 
image of RS codes over a channel, the performance of 
the maximum likelihood decoder depends on the weight 
enumerator of the transmitted binary image. The binary 
image of RS codes is not unique, but depends on the 
basis used to represent the symbols as bits. An average 
binary weight enumerator of RS codes could be derived 
by assuming a binomial distribution of the bits in a 
non-zero symbol [33]. Based on the Poltyrev tangential 
sphere bound (TSB) [34] and the average binary weight 
enumerator, average bounds on the ML error probability 
of RS codes over additive white Gaussian noise (AWGN) 
channels were developed in [33] and were shown to 
be tight. We will refer to this bound by ML-TSB. 
Alternatively the averaged binary weight enumerator 
could be used in conjunction with other tight bounds 
such as the Divsalar simple bound [35] to bound the 
ML error probability. 

C. Numerical Results 

In this subsection, we give some simulation results 
for our algorithm. As noted before, the multiplicity 
assignment algorithm used for ABP-ASD in the these 
simulations is the KV algorithm. N2 denotes the number 
of outer iterations (parallel decoders) and Nl is the num- 
ber of inner iterations in each of these outer iterations. 

1) (15,11) RS code over an AWGN channel: A 
standard binary input AWGN channel is assumed where 
the transmitted codewords are BPSK modulated. In Fig. 
1, we compare the performance of different decoding 
algorithms. HD-BM refers to the performance of a hard 
decision bounded minimum distance decoder such as the 
BM algorithm. The ABP-BM list decoding algorithm 
with A^l = 5 iterations and one iteration of LBP on 
each parity check matrix, Itn — 1 (see step 4 in Alg. 
13 has a coding gain of about 2.5 dB over HD-BM 
at a codeword error rate (CER) of 10 -6 . Increasing 
the number of iterations to Nl = 20 iterations, we 
get a slightly better performance. JN-BM refers to the 
JN algorithm with the JN stopping criterion and a BM 
decoder. Due to the high decoder error probability of the 
(15, 11) code, ABP-BM, with the list decoder stopping 
criterion, yields a much better performance than JN-BM. 
The ABP-ASD list decoding algorithm outperforms all 
the previous algorithms with only 5 ABP iterations and 
with Itn — 3. Comparing its performance with soft 
decision ML decoding of the RS code, we see that ABP- 
ASD has a near ML performance with a performance 
gain of about 3 dB over HD-BM at a CER of 1(T 6 . (ML 
decoding was carried out by running the BCJR algorithm 
on the trellis associated with the binary parity check 
matrix of the RS code [36].) Moreover, the averaged 
TSB on the ML codeword error probability is shown to 
confirm that it is a tight upper bound and that the ABP- 
ASD algorithm is near optimal for this code. 

The performance of different ASD algorithms are 
compared for infinite interpolation costs, the KV al- 
gorithm [9], the Gaussian approximation (Gauss) [10] 
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Fig. 5. The performance of the ABP-ASD decoding of the (31, 25) 
RS code over a Rayieigh fading channel with AWGN when the channel 
information is unknown at the decoder. 

and the Chernoff bound algorithm (Chernoff) [11]. It is 
noted that the Chernoff bound algorithm has the best 
performance, especially at the tail of error probability. It 
is also interesting to compare the performance of ABP- 
ASD with other ASD MA algorithms. It has about 2 dB 
coding gain over the KV algorithm at a CER of 10~ 6 . 
As expected, the Chernoff method has a comparable 
performance at the tail of the error probability. 

The ABP algorithm used in the simulations shown in 
Fig. 1 is Alg. 2. The performance of Alg. 3 was identical 
to that of Alg. 2. However, the complexity is much less. 
The average S (see i ll Oi l) averaged over all iterations was 
calculated versus the SNR. It was observed that the ratio 
of the number of columns to be reduced in Alg. 3 to that 
in Alg. 2 is about 0.1 (5 = 0.9). This gives about a 99% 
reduction in the Gaussian elimination complexity. Thus 
only the first iteration or restart suffers from an Gaussian 
elimination complexity if Alg. 3 is used. 

Near ML decoding for the same code is also achieved 
by the ABP-ASD algorithm with a finite cost of 10 3 
as shown in Fig. 2. Comparisons are made between the 
possible coding gains if the number of iterations is lim- 
ited to Nl = 1. 2, 5. With 5 iterations, the performance 
gain over the KV algorithm, with the same interpolation 
cost, is nearly 1.8 dB at a CER of 10 -5 . Comparing the 
ABP-ASD performance to that of Fig. 1, with infinite 
interpolation costs, we observe that a small loss in 
performance results with reasonable finite interpolation 
costs. Unless otherwise stated, the remaining simulations 
in this paper will assume infinite interpolation costs to 
show the potential of our algorithm. 
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It is to be noted that in simulating the ABP-BM list 
decoder, the simulations using a real BM decoder were 
identical to the simulations using the fast simulation 
setup described in this section. To save simulation time, 
the curves shown here for ABP-ASD are generated using 
the fast simulation setup. As is the case for ABP-BM, 
running the real ABP-ASD decoder will yield the same 
results. 

2) (31,25) RS code over AWGN channel: The argu- 
ments for the (15,11) RS code also carry over for the 
(31,25) RS code when BPSK modulated and transmitted 
over an AWGN channel, as shown in Fig. 3. With 
only 5 iterations, the ABP-BM list decoding algorithm 
outperforms previous ASD algorithms. The performance 
of ABP-ASD with 20 inner iterations (Nl) and 10 outer 
iterations (N2) is better than the ML upper bound and has 
more than 3 dB coding gain over the BM algorithm at an 
CER of 10~ 4 . A favorable performance is also obtained 
by only 3 restarts (outer iterations). By comparing with 
Fig. 2 of [18], our ABP-ASD algorithm has about 1.6 
dB gain over the combined Chase II-GMD algorithm at 
an CER of 10" 4 . 

To show the effectiveness of the restarts or outer itera- 
tions, we kept track of the iteration number at which the 
ABP-ASD list decoder was first capable to successfully 
decode the received word. In other words, this is the 
iteration when the transmitted codeword was first added 
to the ABP-ASD list. The percentage of transmitted 
codewords which were first successfully decoded at a 
certain iteration is plotted versus the iteration number 
in the histogram of Fig. 4. This is shown at a signal 
to noise ratio (SNR) of 3.5 dB and for Nl = 20 
N2 = 10 with a total of 200 iterations. At the beginning 
of each restart (every 20 iterations) there is a boost in 
the number of codewords successfully decoded and this 
number declines again with increasing iterations. The 
zeroth iteration corresponds to the KV algorithm. This 
histogram is also invaluable for decoder design and could 
aid one to determine the designed number of iterations 
for a required CER. 

3) (31, 25) RS code over a Rayleigh Fading Channel: 
As expected from the discussion in Sec. I VII the coding 
gain of ABP-ASD is much more if the underlying chan- 
nel model is not memoryless. This is demonstrated in 
Fig. 5, where an (31, 25) code is BPSK modulated over a 
relatively fast Rayleigh fading channel with AWGN. The 
Doppler frequency is equal to 50 Hz and the codeword 
duration is 0.02 seconds. The coding gain of ABP-ASD 
over the KV algorithm at an CER of 10~ 4 is nearly 5 
dB when the channel is unknown to both decoders. 

4) (255, 239) RS code over AWGN channel: The per- 
formance of the ABP-ASD algorithm is also investigated 
for relatively long codes. The (255, 239) code and its 
shortened version, the (204, 188) code, are standards in 
many communication systems. The performance of the 
(255, 239) code over an AWGN channel is shown in 
Fig. 6. By 20 iterations of ABP-BM, one could achieve 
a coding gain of about 0.5 dB over the KV algorithm. 
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Fig. 6. The performance of the ABP-ASD decoding of the (255, 239) 
RS code over an AWGN channel with BPSK modulation. 

At an CER of 10~ 6 , after a total of 25 outer iterations 
(restarts), the coding gain of ABP-ASD over BM is about 
1.5 dB. An extra 0.1 dB of coding gain is obtained with 
25 more outer iterations. Moreover, the performance of 
the ABP-ASD decoder is within 1 dB of the averaged 
ML TSB. 

5) (31, 15) RS code over AWGN channel: The per- 
formance of our algorithm is studied for the (31, 15) RS 
code over an AWGN channel. The rate of this code is 
0.48. Because this code is of relatively low rate, the HD- 
GS algorithm does improve over the HD-BM bounded 
minimum distance decoding algorithm. As seen from 
Fig. 7, ML soft-decision decoding offers about 4 dB 
coding gain over the hard decision GS algorithm and 
about 2.8 dB coding gain over the soft decision KV 
ASD algorithm at an CER of 10~ 5 . With 20 iterations, 
ABP-BM list decoding improves over the KV algorithm. 
As expected, ABP-ASD has a better performance for 
the same number of iterations. With 10 restarts, ABP- 
ASD has a reasonable performance with about a 3 dB 
coding gain over the BM algorithm. Another 0.5 dB of 
coding gain could be achieved by increasing the number 
of iterations. 

6) General Observations: It is noticed that the coding 
gain between iterations decreases with the number of 
iterations. It is also to be noted that the ABP-ASD 
list decoder requires running the KV ASD algorithm in 
each iteration. Running a number of 'plain-vanilla' ABP 
iterations without the ASD decoder and then decoding 
using the ASD decoder (to reduce the complexity) will 
yield a worse performance for the same number of 
iterations. The same arguments also hold for the ABP- 
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Fig. 7. ABP-ASD list decoding of the (31,15) RS code, of rate 0.48, 
transmitted over an AWGN with BPSK modulation. 

BM list decoding. A reasonable performance is achieved 
by ABP-BM list decoding. By deploying the KV ASD 
algorithm, ABP-ASD list decoding has significant cod- 
ing gains over the KV ASD algorithm and other well 
known soft-decision decoding algorithms. 

IX. Conclusions 

In this paper, we proposed a list decoding algorithm 
for soft-decision decoding of Reed-Solomon codes. Our 
algorithm is based on enhancing the soft reliability 
channel information before passing them to an algebraic 
soft-decision decoding algorithm. This was achieved by 
deploying the Jiang and Narayanan algorithm, which 
runs belief-propagation on an adapted parity check ma- 
trix. Using the Koetter-Vardy algorithm as the alge- 
braic soft-decision decoding algorithm, our algorithm 
has impressive coding gains over previously known soft- 
decision decoding algorithms for RS codes. By com- 
paring with averaged bounds on the performance of 
maximum likelihood decoding of RS codes, we observe 
that our algorithm achieves a near optimal performance 
for relatively short, high-rate codes. We introduced some 
modifications over the JN algorithm that resulted in 
better coding gains. We presented a low complexity 
adaptive belief-propagation algorithm, which results in 
a significant reduction in the computational complexity. 
The performance of our algorithm was studied for the 
cases when the interpolation cost of the algebraic soft- 
decision decoding algorithm is both finite and infinite. A 
small loss in coding gain results when using manageable 
interpolation costs. The coding gain of the presented 
algorithm is larger for channels with memory. Our pro- 
posed algorithm could also be viewed as an interpolation 



multiplicity assignment algorithm for algebraic-soft de- 
coding. 

The question remains whether the JN algorithm is the 
optimum way to process the channel reliabilities before 
algebraic soft-decision decoding. The KV algorithm was 
our ASD decoder of choice due to its low complexity. 
Further investigations would be required to determine the 
best ASD algorithm or, in general, soft-decision decod- 
ing algorithm for joint belief-propagation list-decoding 
with an eye on both the performance and computational 
complexity. 
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